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Production of antibodies using gene libraries* 



Description 

Background of the Invention 

5 Monoclonal and polyclonal antibodies are useful for 

a variety of purposes. The precise antigen specificity 
of antibodies makes them powerful tools that can be used 
for the detection, quantitation, purification and 
neutralisation of antigens. 

10 Polyclonal antibodies are produced in vivo by 

immunizing animals, such as rabbits and goats, vith 
antigens, bleeding the animals and isolating polyclonal 
antibody molecules from the blood. Monoclonal antibodies 
are produced by hybridoma cells, which are made by 

15 fusing, in vitro » immortal plasmacytoma cells with 

antibody producing cells (Kohler, G. and C. Milstein, 
Nature, 256 :495 (1975)) obtained from animals immunized 
in vivo with antigen. 

Current methods for producing polyclonal and mono- 

20 clonal antibodies are limited by several factors. First, 
methods for producing either polyclonal or monoclonal 
antibodies require an in vivo immunization step. This 
can be time consuming and require large amounts of 
antigen. Second, the repertoire of antibodies expressed 

25 iti yiyp is restricted by physiological processes, such as 
those which mediate self -tolerance that disable auto- 
reactive B cells (Goodnow, C.C., et , al. . Nature, 334 :676 
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(1988); Goodnov, J.W., Basic and Clinical I m munolog y, Ed. 
5,, Los Altos, CA, Large Medical Publications (1984); 
Young, C.R., Molecular Immunology , New York, Marcel 
Dekker (1984)), Third, although antibodies can exist in 

05 millions of different forms, each with its own unique 
binding site for antigen, antibody diversity is 
restricted by genetic mechanisms for generating antibody 
diversity (Honjo, T. , Ann. Rev. Immunol. , 1:499 (1983); 
Tonegawa, S. f Nature:302:575 (1983)). Fourth, not all 

10 the antibody molecules which can be generated will be 

generated in a given animal. As a result, raising high 
affinity antibodies to a given antigen can be very time 
consuming and can often fail. Fifth, the production of 
human antibodies of desired specificity is very 

15 problematical. 

A method of producing antibodies which avoids the 
limitations of presently -available methods, such as the 
requirement for immunization of an animal and in vivo 
steps, would be very useful, particularly if it made it 

20 possible to produce a wider range of antibody types than 
can be made using presently-available techniques and if 
it made it possible to produce human antibody types. 

Disclosure of the Invention 

The present invention relates to a method of produc- 

25 ing libraries of genes encoding antigen-combining 

molecules or antibodies; a method of producing antigen- 
combining molecules, also referred to as antibodies, 
which does not require an i n vivo procedure, as is 
required by presently-available methods; a method of 

30 obtaining antigen-combining molecules (antibodies) of 
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selected or defined specificity which does not require an . 
in vivo procedure; vectors useful in the present method 
and antibodies produced or obtained by the method. 

The present invention relates to an in vitro process 

05 for synthesizing DNA encoding families of antigen* 

combining molecules or proteins. In this process, DNA 
containing genes encoding antigen- combining molecules is 
obtained and combined vith oligonucleotides which are 
homologous to regions of the genes which are conserved. 

10 Sequence-specific gene amplification is then carried out 
using the DNA containing genes encoding antigen-combining 
proteins as template and the homologous oligonucleotides 
as primers. 

This invention also relates to a method of creating 

15 diverse libraries of DNAs encoding families of antigen- 
combining proteins by cloning the product of the in § vitro 
process for synthesizing DNA, described in the preceeding 
paragraph, into an appropriate vector (e.g., a plasmid, 
viral or retroviral vector). 

20 The subject invention provides an alternative method 

for the production of antigen-combining molecules, which 
are useful affinity reagents for the detection and 
neutralisation of antigens and the delivery of molecules 
to antigenic sites. The claimed method differs from 

25 production of polyclonal antibody molecules derived by 

immunization of live animals and from production of mono- 
clonal antibody molecules through the use of hybridoma 
cell lines in that it does not require an ip.yivo 
immunization step, as do presently available methods. 

30 Rather, diverse libraries of genes which encode antigen- 
combining sites comprising a significant proportion of an 
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animal's repertoire of antibody combining sites are made, 
as described in detail herein. These genes are expressed 
in living cells, from which molecules of desired 
antigenic selectivity can be isolated and purified for 

05 various uses. 

Antigen-combining molecules are produced by the 
present method in the following manner, vhich is 
described in greater deail below. Initially, a library 
of antibody genes which includes a set of variable 

10 regions encoding a large, diverse and random group of 
specificities derived from animal or human immunoglob- 
ulins is produced by amplifying or cloning diverse 
genomic fragments or cDNAs of antibody mRNAs found in 
antibody-producing tissue. 

!5 In an optional step, the diversity of the resulting 

libraries can be increased by means of random muta- 
genesis. The gene libraries are introduced into cultured 
host cells, which may be eukaryotic or prokaryotic, in 
which they are expressed. Genes encoding antibodies of 

20 desired antigenic specificity are identified, using a 

method described herein or known techniques, isolated and 
expressed in quantities in appropriate host cells, from 
which the encoded antibody can be purified. 

Specifically, a library of genes encoding 

25 immunoglobulin heavy chain regions and a library of genes 
encoding immunoglobulin light chain regions are con- 
structed. This is carried out by obtaining antibody- 
encoding SNA,' which is either genomic fragments or cDNAs 
of antibody mRNAs, amplfying or cloning the fragments or 

30 cDNAs; and introducing them into a standard framework 
antibody gene vector, which is used to introduce the 
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antibody-encoding DNA into cells in which the DNA is 
expressed* The vector includes a framework gene encoding 
a protein, such as a gene encoding an antibody heavy 
chain or an antibody light chain which can be of any 

05 origin (human, non-human) and can be derived from any of 
a nunber of existing DNAs encoding heavy chain immuno- 
globulins or light chain immunoglobulins. Such vectors 
are also a subject of the present invention and are 
described in greater detail in a subsequent section. 

10 Genes from one or both of the libraries are introduced 
into appropriate host cells, in which the genes are 
expressed, resulting in production of a wide variety of 
antigen-combining molecules. 

Genes encoding antigen-combining molecules of 

15 desired specificity are Identified by identifying cells 
producing antigen-combining molecules which react with a 
selected antigen and then obtaining the genes of 
interest. The genes of interest can subsequently be 
introduced into an appropriate host cell (or can be 

20 further modified and then introduced into an appropriate 
host cell) for further production of antigen-combining 
molecules, which can be purified and used for the same 
purposes* for which conventionally-produced antibodies are 
used. 

25 Through use of the method described, it is possible 

to produce antigen-combining molecules which are of wider 
diversity than are antibodies available as a result of 
known methods; novel antigen-combining molecules with a 
diverse range of specificities and affinities and 

30 antigen-combining molecules which are predominantly human 
in origin. Such antigen-combining molecules are a 
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subject of the present invention and can be used 
clinically for diagnostic, therapeutic and prophylactic 
purposes, as veil as in research contexts, and for other 
purposes. 

05 Brief Description of the Drawings 

Figure 1 is a schematic representation of the method 

of the present invention by which antigen- combining 

molecules, or antibodies, are produced. 

Figure 2 is a schematic representation of amplifica- 
10 tion or cloning of IgM heavy chain variable region DNA 

from mRNA, using the polymerase chain reaction. 

P anel A shows the relevant regions of the poly adenylated 

mRNA encoding the secreted form of the IgM heavy chain. 

S denotes the sequences encoding the signal peptide which 
15 causes the nascent peptide to cross the plasma membrane. 

V, D and J together comprise the variable region. C R 1 , 

C u 2, and C„3 are the three constant domains of C/i. Hinge 

encodes the hinge region. C, B and Z are oligonucleotide 

PGR primers (discussed below) . 

20 Pan el B shows the reverse transcript DNA product of the 
mRNA prfmed by oligonucleotide Z, with the addition of 
poly-dC by terminal transferase at the 3' end. 
Panel C is a schematic representation of the annealing of 
primer A to the reverse transcript DNA. 

25 Panel D shows the final double stranded DNA PCR product 
made utilizing primers A and B. 

Panel E shows the product of PCR annealed to primer C. 
Panel F is a blowup of Panel E, showing in greater detail 
the structure of primer C. Primer C consists of two 
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Parts: a 3' part complementary to IgM heavy chain mRNA 
•s shown, and a 5' part which contains restriction site 
RE2 and spacer. 

l5£el_G shows the final double stranded DNA PCR product 
made utilizing primers A and C and the product of the 
previous PGR (depicted in D) as template. The S , V, D J 
regions are again depicted. 

Figure 3 is « schematic representation of the heavy 
chain framework vector pFHC. The circular plasmid 
(above) is depicted linearized (below) and its relevant 
components are shown: animal cell antibiotic resistance 
marker; bacterial replication origin; bacterial cell 
antibiotic resistance marker; Cp enhancer; LTR containing 
the viral promoter from the Moloney MLV retrovirus DNA • 
PGR primer (D) ; cDNA cloning site containing restriction 
endonuclease sites, RE1 and RE2 , separated by spacer DNA ; 
CP exons; and poly A addition and termination sequences 
derived from the C M gene or having the same sequence as 
the Cft gene . 

Figure 4 depicts a nucleotide sequence of the C 1 
•xon of the C„ gene, and its encoded amino acid sequence 
(Panel A)., The nucleotide coordinate numbers are listed 
above the line of nucleotide sequences. Panel B depicts 
the N-doped sequence, as defined in the text. 

25 Detailed Description of the Invent-^ 

The present invention provides a method of producing 
antigen-combining molecules (or antibodies) which does 
not require an invivo immunization procedure and which 
makes it possible to produce antigen- combining molecules 
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with far greater diversity than is shown by antibodies 
produced by currently. available techniques. 

The present invention relates to a method of 
producing libraries of genes encoding antigen-combining 
molecules (antibody proteins) vith diverse 
antigen-combining specificities; a method of producing 
such antigen- combining molecules, antigen- combining 
molecules produced by the method and vectors useful in 
the method. The following is a description of generation 
of such libraries, of the present method of producing 
antigen-combining molecules of selected specificity and 
of vectors useful in producing antigen-combining 
molecules of the present invention. 

As described below, the process makes use of 
techniques which are known to those of skill in the art 
and can be applied as described herein to produce and 
identify antigen-combining molecules of desired antigenic 
specificity: the polymerase chain reaction (PCR) . to 
amplify and clone diverse cDNAs encoding antibody mRNAs 
found in antibody-producing tissue; mutagenesis protocols 
to further increase the diversity of these cDNAs ; gene 
transfer protocols to introduce antibody genes into 
cultured Tprokaryotic and eukaryotic) cells for the 
purpose of expressing them; and screening protocols to 
detect genes encoding antibodies of the desired antigenic 
specificity. A general outline of the present method is 
represented in Figure 1. 



WO 91/10737 



PCT/US91/00209 



-9- 



05 



10 



15 



20 



25 



30 



. Construction of T.*h y « ry of Gene.. ^^ 
Anti£en»Ca mbintn pj Molecules 

A key «tep in the production of antigen-combining 
molecules by the present method is the construction of a 
-library- of antibody genes which include -variable- 
regions encoding a large, diverse, but random set of 
specificities. The library can be of human or non-human 
origin and is constructed as follows: 

Initially, genomic DNA encoding antibodies or cDNAs 
of antibody mRNA (referred to as antibody-encoding DNA) 
is obtained. This DNA can be obtained from any source of 
antibody- producing cells, such as spleen cells, 
peripheral blood cells, lymph nodes, inflammatory tissue 
cells and bone marrow cells. It can also be obtained 
from a genomic library or cDNA library of B cells. The 
antibody-producing cells can be of human or non-human 
origin; genomic DNA or mRNA can be obtained directly from 
the tissue (i.e.. without previous treatment to remove 
cells which do not produce antibody) or can be obtained 
after the tissue has been treated to increase 
concentration of antibody-producing cells or to select a 
particular type(s) of antibody-producing cells (i.e.. 
treated to^ enrich the content of antibody -producing 
cells). Antibodyproduclng cells can be stimulated by an 
agent which stimulates antibody mRNA production (e.g., 
lipopolysaccharide) before DNA is obtained. 

Antibody-encoding DNA is amplified and cloned using 
a known technique, such as the PCR using appropriately- 
selected primers, in order to produce sufficient quanti- 
ties of the DNA and to modify the DNA in such a manner 
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(e.g.. by addition of appropriate restriction sites) that 
it can be introduced as an insert into an E. coli cloning 
vector. This cloning vector can serve as the expression 
vector or the inserts can later be introduced into an 
expression vector, such as the framework antibody gene 
vector described below, amplified and cloned DNA can be 
further diversified, using mutagenesis, such as PCR, in 
order to produce a greater diversity or wider repertoire 
of antigen-binding molecules, as well as novel antigen- 
binding molecules. 

Cloned antibody-encoding DNA is introduced into an 
expression vector, such as the framework antibody gene 
vector of the present invention, which can be a plasmid. 
viral or retroviral vector. Cloned antibody. encoding DNA 
is inserted into the vector in such a manner that the 
cloned DNA will be expressed as protein in appropriate 
host cells. It is essential that the expression vector 
used make it possible for the DNA insert to be expressed 
as a protein in the host cell. One expression vector 
useful in the ^present method is referred to as the 
framework antibody gene vector. Vectors useful in the 
present method contain antibody constant region or 
portions ^hereof in such a manner that when amplified DNA 
is inserted, the vector expresses a chimeric gene product 
comprising a variable region and a constant region in 
proper register. The two regions present in the chimeric 
gene product can be from the same type of immunoglobulin 
molecule or from two different types of immunoglobulin 
solecules . 

These libraries of antibody-encoding genes are then 
expressed in cultured cells, which can be eukaryotic or 
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prokaryotic. The libraries can be introduced into host 
cells separately or together. Introduction of the 
antibody-encoding DNA In vitro into host cells (by 
infection, transformation or transf ection) is carried out 
using known techniques, such as electroporation, 
protoplast fusion or calcium phosphate co-precipitation. 
If only one library is introduced into a host cell, the 
host cell vill generally be one which makes the other 
antibody chain, thus making it possible to produce 
complete/functional antigen-binding molecules. For 
example, if a heavy chain library produced by the present 
method is introduced into host cells, the host cells will 
generally be cultured cells, such as myeloma cells or E^ 
coli, which naturally produce the other (i.e., light) 
chain of the immunoglobulin or are engineered to do so. 
Alternatively, both libraries can be introduced into 
appropriate host cells, either simultaneously or 
sequentially. 

Host cells in which the antibody-encoding DNA is 
expressed can be eukaryotic or prokaryotic. They can be 
immortalized cultured animal cells, such as a myeloma 
cell line which has been shown to efficiently express and 
secrete introduced immunoglobulin genes (Morrison, S.L., 
et_al., Ann. N.Y, Acad^^Scl^, 507:187 (1987); Kohler, G . 
and C. Hilstein, Eur. J. Imnrunol.. 6:511 (1976); Oi, 
V.T., et_al . , Immunoglobulln_Cene ExpressioT^j^ 
Iiansf orm ed Lymphoid Cells. £0:825 (1983); Davis, A.C. 
and M.J. Shulman, Immuno 1 ^_Today , 10:119 (1989)). One 
host cell which can be used to express the antibody- 
encoding DNA is the J558L cell line or the SP2/0 cell 
line. 
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Cells expressing antigen-combining molecules with a 
desired specificity for a given antigen can then be 
selected by a variety of means, such as testing for 
reactivity vith a selected antigen using nitrocellulose 
layering. The antibodies identified thereby can be of 
human origin, nonhuman origin or a combination of both. 
That is. all or some of the components (e.g.. heavy 
chain, light chain, variable regions, constant regions) 
can be encoded by DNA of human or nonhuman origin, which, 
vhen expressed produces the encoded chimeric protein 
vhich, in turn, may be human, nonhuman or a combination 
cf both. In such antigen-combining molecules, all or 
some of the regions (e.g., heavy and light chain variable 
and constant regions) are referred to as being of human 
origin or of nonhuman origin, based on the source of the 
DNA encoding the antigen-combining molecule region in 
question. For example, in the case in which DNA encoding 
mouse heavy chain variable region is expressed in host 
cells, the resulting antigen-combining molecule has a 
heavy chain variable region of mouse origin. Antibodies 
produced may be used for such purposes as drug delivery, 
tumor imaging and other therapeutic, diagnostic and 
prophylactic uses. 

■it ' 

Once antibodies of a desired binding specificity are 
obtained, their genes may be isolated and further 
mutagenized to create additional antigen combining 
diversity or antibodies of higher affinity for antigen. 



WO 91/10737 



PCT/US91/00209 



•13- 



05 



10 



25 



30 



Cgnsjaaictlotw^^ Ge ne Llbyary 

The following is a detailed description of a" 
specific experimental protocol which embodies the 
concepts described above. Although the following is a 
description of one particular embodiment, the same 
procedures can be used to produce libraries in which the 
immunoglobulin and the heavy chain class are different or 
In which light chain genes are amplified and cloned. The 
present invention is not intended to be limited to this 
example. In the embodiment presented below, a diverse 
heavy chain gene library is constructed. Using the 
principles described in relation to the heavy chain gene 
library, a diverse light chain gene library is also 
15 constructed. These are co-expressed in an immortal tumor 
cell capable of producing antibodies, such as plasma- 
cytoma cells or myeloma cells. Cells expressing antibody 
reactive to antigen are identified by a nitrocellulose 
filter overlay and antibody is prepared from cells 
Identified as expressing it. As described in a subse- 
quent section, there are alternative methods of library 
construction, other expression systems which can be used 
and alternative selection systems for identifying anti- 
bo dy-produjting cells or viruses. 

Step 1 m this specific protocol is construction of 
libraries of genes in E. coli which encode immunoglobulin 
heavy chains. This is followed by the use of random 
mutagenesis to increase the diversity of the library, 
which is an optional procedure. Step 2 is introduction 
of the library, by transf ection , into myeloma cells. 
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Step 3 is identification of myeloma cells expressing 
antibody with the desired specificity, using the 
nitrocellulose filter overlay technique or techniques 
known to those of skill in the art. Step 4 is isolation 
05 of the gene(s) encoding the antibody with the desired 
specificity and their expression in appropriate host 
cells , to produce antigen-combining fragments useful for 
a variety of purposes. 

Constructio n 

10 One key step in construction of the library of cDNAs 

encoding the variable region of mouse heavy chain genes 
is construction of an E^coli* plasmld vector, designated 
pFHC. pFHC contains a "framework" gene, which can be 
any antibody heavy chain and serves as a site into which 

15 the amplified cloned gene product (genomic DNA or cDNA of 
antibody mRNAs) is introduced. pFHC is useful as a 
vector for this purpose because it contains RE1 and RE2 
cloning sites. Other vectors which include a framework 
gene and other cloning sites can be used for this purpose 

20 as well. The framework gene includes a transcriptional 
promoter (e.g., a powerful promoter, such as a Moloney 
LTR (Mulligan, R.C., In Experimental_Manl£ulatipn, of Gene 
Expression. New York Adacemic Press, p. 155 (1983)) and a 
Cp chain transcriptional enhancer to increase the level 

25 of transcriptions from the promoter (Gillies, S.D., et 
££ii» 21:717 (1*83), a cloning site containing RE1 
and RE2; part of the Cp heavy chain gene encoding 
secreted protein; and poly A addition and termination 
sequences (Figure 3). The framework antibody gene vector 

30 of the present invention (pFHC) also includes a 
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selectable Barker (e.g., an antibiotic resistance gene 
such as the neomycin resistance gene, neo R ) for animal 
cells; sequences for bacterial replication (ori); and a 
selectable marker (e.g., the ampicillin resistance gene, 
Amp ) for bacterial cells. The framework gene can be of 
any origin (human, non - human ) , and can derive from any 
one of a number of existing DNAs encoding heavy chain 
immunoglobulins (Tucker, P.W., et al. , Science, 206:1299 
(1979); Honjo, T. , et_al.. Cell, 18:559 (1979); Bothwell, 
A.L.M., at .al. , Cell, 24:625 (1981); Liu, A.Y, et al. . 
Gene. 54:33 (1987); Kawakami, T. . et al . . Hue. Acids. 
Silt' 1:3933 (1980)). In this embodiment, the vector 
retains the introns between the C^l , hinge, C H 2 and C H 3 
exons. The "variable region" of the gene, which includes 
the V, D and J regions of the antibody heavy chain and 
which encodes the antigen binding site, is deleted and 
replaced with two consecutive restriction endOnuclease 
cloning sites, RE1 and RE2 . The restriction endonuclease 
site RE1 occurs Just 3' to the LTR promoter and the 
restriction endonuclease site RE2 occurs within the 
constant region Just 3' to the J region (see Figure 3). 

Another key step in the production of antigen- 
combining' molecules in this embodiment of the present 
invention is construction in an E A _coll vector of a 
25 library of cDNAs encoding the variable region of mouse 
immunoglobulin genes. In this embodiment, the pFHC 
vector, which includes cloning sites designated RE1 and 
RE2, is used for cloning heavy chain variable regions, 
although any cloning vector with cloning sites having the 
same or similar characteristics (described below) can be 
used. Similarly, a light chain vector can be designed, 
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using the above described procedures and procedures known 
to a person of ordinary skill in the art. 

In this embodiment, non- immune mouse spleens are 
used as the starting material. mRNA is prepared directly 
from the spleen or from spleen processed in such a manner 
that it is enriched for resting B cells. Enrichment of 
tissue results in a more uniform representation of 
antibody diversity in the starting materials. 
Lymphocytes can be purified from spleen using ficoll 
10 gradients (Boyum, A., Scand. J. of Clinic al Invest. , 
21:77 (1968)). B cells are separated from other cells 
(e.g., T cells) by panning with anti-IgM coated dishes 
(Wysocki, L.J. and V.L. Sato, Proc. Natl. Acad. Sei. , 
75:2844 (1978)). Because activated cells express the 
1L-2 receptor but resting B cells do not, resting B cells 
can be separated yet further from activated cells by 
panning. Further purification by size fractionation on a 
Cell Sorter results in a fairly homogeneous population of 
resting B cells. 

Poly A+ mRNA from total mouse spleen is prepared 
according to published methods (Sambrook, J., et_al. , 

Molecular ^loningj A Laboratory Manual , 2d Ed., Cold 

Spring Harbor Laboratory Press, Cold Spring Harbor, NY 
(1989)). Production of antibody mRNA can first be 
25 stimulated by lipopolysaccharide (LPS) (Andersson, J. A., 
J. Exp. Mgd^, 145:1511 (1977)). First strand 
cDNA is prepared to this mRNA population using as primer 
an oligonucleotide, Z, which is complementary to Cp in 
the C H 1 region 3* to J . This primer is designated Z in 
Figure 2. First strand cDNA is then elongated by the 
terminal transferase reaction with dCTP to form a poly dC 



15 



20 



30 



WO 91/10737 



PCT/US91/00209 



•17- 



05 



15 



tail (Sambrook, J., et §1. , Molecular Clonlngj A 
Laboratory Manua l, 2d Ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY (1989)). 

This DNA product is then used as template in a 
polymerase chain reaction (PCR) to amplify cDNAs encoding 
antibody variable regions (Saiki, R.K., et al., Science. 
239:487 (1988); Ohara. 0., et al . , Proc. ult l. Acad^JIj 
21A, 86:5673 (1989)). Initially, PCR is carried out with 
two primers: primer A and primer B , as represented in 
10 Figure 2. Primer A contains the RE1 site at its 5» end. 
followed by poly dG. Primer B is complementary to the 
constant (C H 1) region of the Cp gene, 3' to the J region 
and 5' to primer Z (see Figure 2). Primer B is 
complementary to all C/i genes, which encode the heavy 
chain of molecules of the IgM class, the Ig class 
expressed by all B cell clones prior to class switching 
(Schimizu, A. and T. Honjo, Cell, 36:801-803 U°84)) and 
present in resting B cells. The resultant PCR product 
includes a significant proportion of cDNAs encompassing 
the various V R regions expressed as IgM in the mouse. 
(The use of other primers complementary to the cDNA genes 
encoding the constant regions of other immunoglobulin 
heavy chains can be used in parallel reactions to obtain 
the variable regions expressed on these molecules, but 
for simplicity these are not described). 

Next, the product of the first PCR procedure is used 
again for PCR with primer A and primer C. Primer C. like 
primer B, is complementary to the Cp gene 3 » to J and 
just 5' to primer B (see Figure 2). Primer C contains 
30 the RE2 site at its 5' end. The RE2 sequence is chosen 
in such a manner that when it is incorporated into the 
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framework vector, no alteration of coding sequence of the 
C/. chain occurs (See Figures 2 and 3). This method of 
amplifying C M cDNAs , referred to .s unidirectional nested 
PGR, incorporates the idea of nested primers for cloning 
05 a gene when the nucleotide sequence of only one region of 
the gene is known (Ohara. 0., «t_al. , Proc. Natl. a«.« 
lel^SSA. 86:5673 (1989)). The PGR product is then ~~ 
cleaved with restriction enzymes RE1 and RE2 and cloned 
into the RE1 and RE2 sites of the pPHC vector (described 
10 below). The sequence of primers end of RE1 and RE2 sites 
«re selected so that when the PGR product is cloned into 
these sites, the sites are recreated and the cloned 
antibody gene fragments are brought back into the proper 
frame with respect to the framework immunoglobulin gene 
15 present in pF HC. This results in creation of a C„ 

-inigene which lacks the intron normally present between 

" d tbe C H 1 re 6 ion °f C„ (See Figure 3). These 
procedures result in production of the heavy chain 
library used to produce antigen-binding molecules of the 
20 present invention, as described further below. 

Optionally, diversity of the heavy chain variable 
region is increased by random mutagenesis, using 

to those of skill in the art. 
For example, the library produced as described above 
15 is amplified again, using PGR under conditions of 

limiting nucleotide concentration. Such conditions are 
known to increase the infidelity of the polymerisation 
and result in production of mutant products. Primers 
useful for this reaction are Primers C and D as 
0 represented in Figures 2 and 3. Primer D derives from 
PFHC Just 5' to RE1. The PGR product, after cleavage 
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with RE1 and RE2 , is recloned into the framework vector 
pFHC. To the extent that nutation affects codons of the 
antigen binding region* this procedure increases the 
diversity of the binding domains. For example, if the 

05 starter library has a complexity of 10* elements, and an 
average of one mutation is introduced per complementarity 
determining region, and it is assumed that the 
complementarity determining region is 40 amino acids in 
size and that any of six amino acid substitutions can 

10 occur at a mutated codon, the diversity of the library 

can be increased by a factor of about 40 x 6, or 240, for 

single amino acid changes and 240 x 240, or about 
4 

6 x 10 , for double amino acid changes, yielding a final 
diversity of approximately 10 11 . This is considered to 

15 be in the range of the diversity of antibodies which 

animals produce (Tonegawa, S., Nature , 302: 575 (1983)). 
Even greater diversity can be generated by the random 
combination of H and L chains, the result of co-expres- 
sion in host cells (see below). It is, thus, theoreti- 

20 cally possible to generate a more diverse antibody 

library in vitro than can be generated in v ivo. This 
library of genes is called the "high diversity" heavy 
chain library. It may be propagated indefinitely in E^ 
££li- A high diversity light chain library can be 

25 prepared similarly. 

The framework vector for the light chain library, 
designated pFLC, includes components similar to those in 
the vector for the heavy chain library: the enhancer, 
promoter, a bacterial selectable marker, an animal 

30 selectable marker, bacterial origin of replication and 
light chain exons encoding the constant regions. For 
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pFLC, the animal selectable marker should differ from the 

animal selectable in pFHC. For example, if pFHC contains 

R. 

neo t pFLC can contain Eco gpt. 

A light chain library, which contains diverse light 

05 chain fragments, is prepared as described above for 

construction of the heavy chain library. In constructing 
the light chain library, the primers used are different 
from those described above for heavy chain library 
construction* In this instance, the primers are 

10 complementary to light chain mRNA encoding constant 

regions. The framework vector contains the light chain 
constant region axons . 

Intro duction^of the Library of Immunoglobulin Chain Genes 
into Immo rtalized Anim al Ce lls 
15 The library of immunoglobulin chain genes produced 

as described is subsequently introduced into a line of 
immortalized cultured animal cells, referred to as the 
•host" cells, in which the genes in the library are 
expressed. Particularly useful for this purpose are 
20 plasmacytoma cell lines or myeloma cell lines which have 
been shown to efficiently express and secrete introduced 
immunoglobulin genes (Morrison, S.L., et_al. , Ann. N.Y. 
Acad^_Sci. , 507:187 (1987); Kohler, G. and C, Milstein, 
Eur. J. Immunol.. 6:511 (1976); Galfre and C. Milstein, 
25 Methods Enzymol. . 73:3 (1981); Davis, A.C. and M.J. 

Shulman, Immuno l . Today . 10:119 (1989)). For example, 
the J558L cell line can be cotransf ected using electro- 
poration or protoplast fusion (Morrison, S.L., e t .al. , 
Ann, iH H,Y. Acad. Sci. . 507:187 (1987)) and transfected 
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cells selected on the basis of auxotrophic markers 
present on light and heavy chain libraries. 

As a result of cotransf ormation and selection for 
markers on both light chain and heavy chain vectors, most 

05 transformed host cells vill express several copies of 
immunoglobulin heavy and light chains from the diverse 
library, and will express chimeric antibodies (antibodies 
encoded by all or part of two or more genes) (Nisonoff , 
A., et al. t In The Ant i body Molecule . Academic Press, NY 

10 p. 238 (1975)). These chimeric antibodies are of two 
types: those in which one chain is encoded by a host 
cell gene and the other chain is encoded by an exogen- 
ously introduced antibody gene and those in which both 
the light and the heavy chain are encoded by an exogenous 

15 antibody gene. Both types of antibodies will be 

secreted. A library of cells producing antibodies of 
diverse specificities is produced as a result. The 
library of cells can be stored and maintained in- 
definitely by continuous culture and/or by freezing. A . 

20 virtually unlimited number of cells can be obtained by 
this process. 

Isolation, of Cells Producing Ant ifien^Binding_!^ecules_of 
Selected Specif icity 

25 Cells producing antigen-binding molecules of 

selected specificity (i.e., which bind to a selected 
antigen) can be identified and isolated using 
nitrocellulose filter layering or known techniques. The 
same methods employed to identify and isolate hybridoma 

30 cells producing a desired antibody can be used: cells 
are pooled and the supernatants tested for reactivity 
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with antigen (Harlow, E. and D . Lane, Antibodie s : A 

Laboratory Manual , Cold Spring Harbor Laboratory, N.Y. , 
p. 283 (1988). Subsequently, individual clones of cells 
are identified, using known techniques. A preferred 

05 method for identification and isolation of cells makes 
use of nitrocellulose filter overlays, which allow the 
screening of a large number of cells. Cells from the 
library of transfected myeloma cells are seeded in 10 cm 
petri dishes in soft agar (Cook, W.D. and M.D. Scharff , 

10 gNAS, 74:5687 (1977); Paige, C.J., et al. . Methods, in 
Bnzymol. , 150:257 (1987)) at a density of 10 4 colony 
forming units, and allowed to form small colonies 
(approximately 300 cells). A large number of dishes 
• (>100) may be so seeded. Cells are then overlayed with a 

15 thin film of agarose (<lmm) and the agarose is allowed to 
harden. The agarose contains culture medium without 
serum. Nitrocellulose filters (or other protein-binding 
filters) are layered on top of the agarose, and the 
dishes are incubated overnight. During this time, 

20 antibodies secreted by the cells will diffuse through the 
agarose and adhere to the nitrocellulose filters. The 
nitrocellulose filters are keyed to the underlying plate 
and remoyed for processing. 

The method for processing nitrocellulose filters is 

25 identical to the methods used for Western blotting 

(Harlow, E. and D. Lane, Antibodies: Labora tory Manual . 
Cold Spring Harbor, N.Y., p. 283 (1988)). The antibody 
molecules are adsorbed to the nitrocellulose filter. The 
filters, as prepared above, are then blocked. The 

30 desired antigen, for example, keyhole lymphet hemocyanin 
(KLH) , which has been iodinated with radioactive 125 I, is 
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then applied in Vestern blotting buffers to the filters. 
(Other, non radiographic methods can be used for 
detection). After incubation, the filters are vashed and 
dried and used to expose autoradiography film according 

05 to standard procedures. Where the filters have adsorbed 
antibody ■olecules vhich are capable of binding KLH , the 
autoradiography film will be exposed. Cells expressing 
the KLH reactive antibody can be identified by 
deternining the location on the dish corresponding to an 

10 exposed filter; cells identified in this manner can be 
isolated using known techniques. Cells vhich are 
isolated from a region of the dish can then be 
rescreened, to insure the isolation of the clone of 
antigen-binding molecule-producing cells. 

15 Isolatio n of Genes Encoding Antigen-Binding, Molecules of 
Selected Specifi city and Purification of Encoded 
Antigen-B indin g Molecule s 

The gene(s) encoding an antigen-binding molecule of 
selected specificity can be isolated. This can be 

20 carried out, for example, as follows: primers D and C 
(see Figures 2 and 3) are used in a polymerase chain 
reactionf to produce all the heavy chain variable region 
genes introduced into the candidate host cell from the 
library. These genes are cloned again in the framework 

25 vector pFHC at the RE1 and RE2 sites. Similarly, all the 
light chain regions introduced into the host cell from 
the library are cloned into the light chain vector, pFLC. 
Members of the family of vectors so obtained are then 
transformed pairvise into myeloma cells, vhich are tested 

30 for the ability to produce and secrete the antibody with 
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the desired selectivity. Purification of the antibody 
from these cells can then be accomplished using standard 
procedures (Johnstone, A. and R. Thorpe, Immunochem. in 
Practice , Blackwell Scientific, Oxford, p. 27 (1982); 
05 Harlow, E. and D. Lane, Antibodies: A Laboratory Manual , 
Cold Spring Harbor Laboratory, N.Y., p. 283 (1988)). 

Alteration of Affinity of Anti gen- Binding Molecules 
It is also possible to produce antigen-binding 
molecules whose affinity for a selected antigen is 

10 altered (e.g., different from the affinity of a 

corresponding antigen-binding molecule produced by the 
present method). This can be carried out, for example, 
to increase the affinity of an antigen-binding molecule 
by randomly mutagenizing the genes isolated as described 

15 above using previously-described mutagenesis methods. 
Alternatively, the variable region of antigen-binding 
molecule-encoding genes can be sequenced and site 
directed mutagenesis performed to mutate the comple- 
mentarity determining regions (CDR) (Rabat, E.A. , 

20 Immunol . . 141 :S 25-36 (1988)). Both processes result in 
production of a sublibrary of genes which can be screened 
for antigen-binding molecules of higher affinity or of 
altered affinity after the genes are expressed in myeloma 
cells . 

25 Alternative Materials and P rocedures _ for Use in the 
Present Method 

In addition to those described above for use in the 
method of the present invention, other materials (e.g., 
starting materials, primers) and procedures can be used 
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in carrying out the method. For example, use of PCR 
technology to clone a large collection of cDNA genes 
encoding variable regions of heavy chains has been 
described above. Although primers from the Cp class were 

05 described as being used in unidirectional nested PCR, the 
present: Invention is not limited to these conditions. 
For example, primers from any of the other heavy chain 
classes (Cy^ f Cj 1$ Cy 2 b 9 Ca for exam P le ) or from light 
chains can be used. Cp was described as of particular 

10 use because of the fact that the entire repertoire of 
heavy ..chain variable regions are initially expressed as 
IgM. Only following heavy-chain class switching are 
these variable regions expressed with a heavy chain of a 
different class (Shimizu, A. and T. Honjo, Cell , 

15 36:801-803 (1984)). In addition, the predominant 
population of B cells in nonimmune spleen cells is 
IgM + -cells (Cooper, M.D. and P. Burrows, In 
Immunoglobuli n Genes . Academic Press, N.Y. p. 1 (1989)). 
Although unidirectional nested PCR amplification is 

20 described above, other PCR procedures, as well as other 
DNA amplification techniques can be used to amplify DNA 
as needed in the present invention. For example, 
bidirectional PCR amplification of antibody variable 
regions can be carried out. This approach requires use 

25 of multiple degenerate 5' primers (Orlandi, R. f et al. . 
Proc. Natl. Acad. Sci. USA. 86:3833 (1989); Sastry, L. , 
et al, . Proc. Rati. Acad. Sci. PSA. 86:5728 (1989)). 
Bidirectional amplification may not pick up the same full 
diversity of genes as can be expected from unidirectional 

30 PCR. 
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In addition, methods of introducing further 
diversity into the antibody library other than the method 
for random mutagenesis utilizing PCR described above can 
be used. Other methods of random mutagenesis, such as 

05 that described by Saobrook, et,al. (Sambrook, J., et al . , 
Molecular Cloning: A Laboratory Manual , Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989)) 
can be used, as can direct mutagenesis of the comple- 
mentarity determining regions (CDRs). 

10 Framework vectors other than one using a mouse Cp 

heavy chain constant region, which contains the C/j 
enhancer and introns and a viral promoter (described 
previously) can be used for , inserting the products of 
PCR. The vectors described were chosen for their 

15 subsequent use in the expression of the antibody genes, 

but any eukaryotic or prokaryotic cloning vector could be 
used to create a library of diverse cDNA genes encoding 
variable regions of antibody molecules. The inserts from 
this vector could be transferred to any number of 

20 expression vectors. For example, other framework vectors 
which include intronless genes can be constructed, as can 
other heavy chain constant regions. In addition to 
plasmid vectors, viral vectors or retroviral vectors can 
be used to introduce genes into myeloma cells. 

25 The source for -antibody molecule mRNAs can also be 

varied. Purified resting B lymphocytes from mouse 
nonimmunized spleen are described above as such a source. 
However, total spleens (immunized or not) from other 
animals 9 including humans, can be used, as can any source 

30 of antibody-producing cells (e.g., peripheral blood, 
lymph nodes, inflammatory tissue, bone marrow). 



WO 91/10737 



PCT/US91/00209 



-27- 

Introduction of H and L chain gene DNA into myeloma 
cells using cotransf ormation by electroporation or 
protoplast fusion methods is described above (Morrison, 
S.L. and V.T. Oi, Adv. Immunol. . 44:65 (1989)). However, 

05 any means by which DNA can be introduced into living 
cells in vivo can be used, provided that it does not 
significantly interfere, with the ability of the 
transformed cells to express the introduced DNA. In 
fact, a method other than cotransf ormation , can be used. 

10 Cotransf ection was chosen for its simplicity, and because 
both the H and L chains can be Introduced into myeloma 
cells. It may be possible to introduce only the H chain 
into myeloma cells. Moreover, the H chain itself in many 
cases carries sufficient binding affinity for antigen. 

15 However , other methods can also be used. For example, 
retroviral infection may be used. Replication-incompe- 
tent retroviral vectors can be readily constructed which 
can be packaged into infective particles by helper cells 
(Mann, R. , et al. . Cell, 33:153-159 (1903)). Viral 

20 titers of 10^ infectious units per ml. can be achieved, 
making possible the transfer of very large numbers of 
genes, into myeloma cells. 

Further increases in the diversity of antibody- 
producing cells than results from the method described 

25 above can be generated if light and heavy chain genes are 
introduced separately into myeloma cells. Light chain 
genes can be introduced into one set of myeloma cells 
with one selectable marker, and heavy chains into another 
set of cells with a different selectable marker. Myeloma 

30 cells containing and expressing both H and L chains could 
then be generated by the highly efficient process of 
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polyethylene glycol mediated cell fusion (Pontecorvo, G . , 
Somatic Cell Genetics , 1:397 (1975)). Thus, a method of 
screening diverse libraries of antibody genes using 
aniaal cells is not limited by the number of cells which 
05 can be generated, but by the number of cells which can be 
screened. 

Methods of identifying antigen-binding molecule - 

expressing cells expressing an antigen-binding molecule 

of selected specificity other than the nitrocellulose 

10 filter overlay technique described above can be used. An 

important characteristic of any method is that it be 

useful to screen large numbers of different antibodies. 

With the nitrocellulose filter overlay technique, for 

4 

example, if 300 dishes are prepared and 10 independent 
15 transformed host cells per dish are screened, and if, on 

average, each cell produces ten different antibody 

4 7 
molecules, then 300 x 10 x 3, or about 10 different 

antibodies can be screened at once. However, if the 

antibody molecules can be displayed on the cell surface, 

20 still larger numbers of cells can be screened using 
affinity matrices to pre- enrich for antigen-binding 
cells. There are immortal B cell lines, such as BCL^B^ , 
which will express IgM both on the cell surface and as a 
secreted form (Granowicz, E.S., et al . , J . Immunol . . 

25 125:976 (1980)). If such cells are infected by 

retroviral vectors containing the terminal C/i exons , the 
infected cells will likely produce both secreted and 
membrane bond forms of IgK (Webb, C.F., et al . » J . 
Immunol. , 143:3934-3939 (1989)). Still other methods can 

30 be used to detect antibody production. If the host cell 
* s E - coli , a nitrocellulose overlay is possible, and 
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such methods have been frequently used to detect E . coli 
producing particular proteins (Sambrook, J., et r al . , 
Molecular Cloning: A Laboratory Manual , Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y. 

05 (1989)). Other methods of detection are possible and one 
in particular, vhich involves the concept of "viral 
coating", is discussed below. 

Viral coating can be used as a means of identifying 
viruses encoding antigen-combining molecules. In this 

10 method, a viral vector is used to direct the synthesis of 
diverse antibody molecules. Upon lytic infection of host 
cells, and subsequent cell lysis, the virus becomes 
"coated" vith the antibody product it directs. That is, 
the antibody molecule becomes physically linked to the 

15 outside of a mature virus particle, vhich can direct its 
synthesis. Methods for viral coating are described 
below. Viruses coated by antibody can be physically 
selected on the basis of their affinity to antigen vhich 
is attached to a solid support. The number of particles 

20 vhich can be screened using this approach is veil in 

9 11 
excess of 10 and it is possible that 10 different 

antibody genes could be screened in this manner. In one 

embodiment, an affinity matrix containing antigen used to 

purify those viruses encoding antibody molecules with 

25 affinity to antigen and which coat the surface of the 
virus vhich encodes those antibodies is used. 

One method of viral coating is as follows: A 
diverse library of bacteriophage X encoding parts of 
antibody molecules that are expressed in infected E. col i 

30 and vhich retain the ability to bind antigens is created, 
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using known techniques (Orlandi, R. , et al . . Proc . .. Natl . 

Acad, Sci. US A , 86:3833 (1989); Huse, W.D., et al., 

Science, 246:1275 (1989); Better. H. , et al . . Science. 

240 :1041 (1988); Skerra, A. and A. Pluckthon, Science. 

05 240:1038 (1988)). Bacteria infected with phage are 

embedded in a thin film of semisolid agar. Greater than 

10^ infected bacteria may be plated in the presence of an 

excess of uninfected bacteria in a volume of 1 ml of agar 

2 

and spread over a 10 cm surface. The agar contains 

10 monovalent antibody "A" (Farham, P.. In Handbook of 

Experim ental Immunology : Immunochem . , Blackvell 
Scientific Publishers, Cambridge, MA, pp. 14.1-14.23 
(1986)), which can bind the X coat proteins and which has 
been chemically coupled to monovalent antibody M B n , which 

15 can bind an epitope on all viral directed antibody 

molecules. Monovalent antibodies are used to prevent the 
crosslinking of viral particles. Upon lytic burst, 
progeny phage particles become effectively cross linked 
to the antibody molecule they encode. Because lysis 

20 occurs in semisolid medium, in which diffusion is slow, 
cross linking between a given phage and the antibody 
encoded by another phage is minimized. A nitrocellulose 
filter (or other protein binding filter) is prepared as 
an affinity matrix by adsorbing the desired antigen. The 

25 filter is then blocked so that no other proteins bind 

nonspeclf ically . The filter is overlayed upon the agar, 
and coated phage are allowed to bind to the antigen by 
way of their adherent antibody molecules. Filters are 
washed to remove nonspeclf ically bound phage. 

30 Specifically bound phage therefore represent phage 
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encoding antibodies with the desired specificity. These 
can now be propagated by reinfection of bacteria. 

Thus the present invention makes it possible to 
produce antigen-binding molecules which, like antibodies 
05 produced by presently-available techniques r bind to a 
selected antigen (i.e., having binding specif ity) . 
Antibodies produced as described can be used, for 
example, to detect and neutralize antigens and deliver 
molecules to antigenic sites. 

10 EXAMPLE I Amplification of IgM HeayyChain Variable 
Region DNA from mRNA 
IgM heavy chain variable DNA is amplified from mRNA 
by the procedure represented schematically in Figure 2. 
In Figure 2, Panel A depicts the relevant regions of the 

15 poly adenylated mRNA encoding the secreted form of the 
IgM heavy chain. In Panel A, S denotes the sequences 
encoding the signal peptide which causes the nascent 
peptide to cross the plasma membrane, a necessary step in 
the processing and secretion of the antibody. V, D and J 

20 derive from separate exons and together comprise the 

variable region. C H l f C H 2, and C H 3 are the three constant 
domains of C/t • "Hinge* encodes the hinge region. C, B 
and Z are oligonucleotide PCR primers used in the 
amplification process. The only constraints on Primers B 

25 *tid Z are that they are complementary to the mRNA, and 
occur in the order shown relative to C. Primer C, in 
addition to being complementary to mRNA, has an extra bit 
of sequence at its 5' end which allows the cloning of its 
PCR product. This is described below. Panel B depicts 

30 the reverse transcript DNA product of the mRNA primed by 
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oligonucleotide Z, with the addition of poly-dC by 
terminal transferase at the 3 r end of the product. Panel 
C depicts the annealing of primer A to the reverse 
transcript DNA represented in Panel B . Primer A contains 

05 the restriction endonuclease site RE1 , with additional 
DNA at its 5' end. The constraints on the RE1 site are 
described in Example 2. Panel D depicts the final double 
stranded DNA PCR product made utilizing primers A and B . 
Panel E depicts the PCR product shown in Panel D annealed 

10 to Primer C. Panel F is a blov up of panel E shoving the 
structure of primer C. Primer C consists of two parts: 
a 3 r part complementary to IgK heavy chain mRNA as shown, 
and a 5' part which contains restriction site RE 2 and 
spacer. Constraints on RE2 are described in Example 2. 

15 Panel G depicts the final double stranded DNA PCR product 
utilizing Primers A and C and the product of the previous 
PCR (depicted in Panel D) as template. The S, V, D , J 
regions are again depicted. 

EXAMPLE 2 Constru ction of Hea vy Chain Fra mewor k Vector 

20 PFHC 

A h&avy chain framework vector, designated pFHC, is 
constructed, using known techniques (See Figure 3). It 
is useful for introducing antibody-encoding DNA into host 
cells, in which the DNA is expressed, resulting in 

25 antibody production. The circular plasmid (above) is 

depicted linearized (below) and its relevant components 
are shown. The neomycin antibiotic resistance gene 
(neo ) is useful for selecting transformed animal cells 
(Sambrook, J., et al . , Molecular Clon ing: A Laboratory 

30 Manual . 2d Ed., Cold Spring Harbor Laboratory Press, Cold 



WO 91/10737 



PCT/US91/00209 



Spring Harbor, NY (1989)). The bacterial replication 
origin and ampicillin antibiotic resistance genes, useful 
respectively, for replication in E. c oli and rendering 
coli resistant to ampicillin, can derive from any number 

05 of bacterial plasmids , including PBR322 (Sanbrook, J . 9 et 

fil- » Mol ecular Clonin gj A Laboratory Manual. 2d Ed, , 

Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY (1989)). The Cm enhancer, which derives from the 
intron between exons J and C H 1 of the C/* gene, derives 

10 from any one of the cloned Cp genes (Kawakami, T., et 
£!•• Nuclei c Acids Researc h. 8:3933 (1980); Honjo, T., 
ASB^Sl^^SSHB£l^. 1:^?9 (1983)) and increases levels of 
transcription from antibody genes. LTR contains the 
viral promoter from the Moloney MLV retrovirus DNA 

15 (Mul 1 i gan , R . C . . Experimental Manlpulatlon^ofj^p 
Expression, New York Academic Press, p. 155 (1983)). 
D represents the PCR primer described in the text, 
depicted in its 5' to 3' orientation. The only con- 
straints on D are its orientation, its complementarity to 

20 pFHC and its order relative to the RE1 and RE2 cloning 
sites. Preferably. D is within 100 nucleotides of RE1. 
The cDNA cloning site contains restriction endonuclease 
sites REl- r and RE2, separated by spacer DNA which allows 
their efficient cleavage. The constraints on RE1 and RE2 

25 are described below. The C/i exons. as described in the 
text and literature, direct the synthesis of IgM heavy 
chain. Only part of C H 1 is present, as described below. 
C H 3 is chosen to contain the Cps region which specifies a 
secreted form of the heavy chain ((Kawakami, T. , et al . . 

30 Nucleic_Aclds Research, 8:3933 (1980); Honjo, T. , Ann? 
Rev, Immunol^, 1:499 (1983)). Finally, pFHC contains 
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poly A addition and termination sequences which can be 
derived from the C M gene itself (Honjo, T. , Ann, Rev. 
Immunol. , 1:499 (1983); ICavakami, T. , et al . . Nucleic 
Acids Research, 8:3933 (1980)). One potential advantage 
05 of using the entire Cm gene is that in some host cell 

systems, a membrane bound and secreted form of IgM may be 
expressed (Granowicz, E.S., et al . , J. Immunol . 125:976 
(1980)). 

The plasmid can be produced by combining the 
10 individual components, or nucleic acid segments, depicted 
in Figure 3, using PCR cassett assembly (See below). 
Because the entire nucleotide sequence of each component 
is defined, the entire nucleotide sequence of the plasma 
is defined. 

15 The constraints on RE1 are simple. It should be the 

sole cleavage site on the plasmid for its restriction 
endonuclease. The choice of RE1 can be made by computer 
based sequence analysis (Intelligenetics Suite, Release 
5:35, Intelligenetics). 

20 The constraints on RE2 are more complex. First, it 

must be the sole cleavage site on the plasmid for its 
restriction endonuclease, as described for RE1. 
Moreover , ithe RE2 site must be such that when the PCR 
product is inserted, a gene is thereby created which is 

25 capable of directing the synthesis of a complete IgM 
heavy chain. This limits the choices for RE2, but the 
choices available can be determined by computer based 
sequence analysis. The choices can be determined as 
follows. First, a list of restriction endonucleases that 

30 do not cleave pFHC is compiled (see Table 1). 
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TABLE 1 

Mon-Cuttlng Enzymes for the Mouse Cn Gene 



10 



15 



AatH 


Aha 1 1 


Asel 


Avrll 


Bgll 


BspHI 


BssHII 


BstBI 


Clal 


Dral 


EagI 


EcoRI 


EcoRV 


Fspl 


Hgal 


Hindi 


Hpal 


Kpnl 


Mlul 


Nael 


Narl 


Ndel 


NotI 


Brul 


PaeR7I 


Pvul 


RsrII 


SacII 


Sail 


Seal 


Sfll 


SnaBI 


Spel 


SphI 


Sspl 


StuI 


Tthllll 


Xbal 


Xhol 



These are called the "rare non-cutters . ■ Next, the 
sequence of C^l is rewritten with "N" at the third 
position of each codon and entered into the computer. 
This is called the "N-doped sequence" (See Figure 4). 

20 Next, the rare non-cutters are surveyed by computer 
analysis for those which will cleave the N-doped 
sequence. The search program will show a possible 
restriction endonuclease site, assuming a match between N 
and the restriction endonuclease cutting site. For 

25 example, with 39 rare non-cutters, 22 will cleave the 
N-doped sequence of C/i C H 1, many of them several times 
(see Table 2). In this table, "Def" means a definite cut 
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site, pf which there are none, because of the Ns . "Pos" 
means a possible cleavage site at the indicated nucleo- 
tide position if N is chosen appropriately. "Y" 
indicates any pyrimidine , "R" indicates any purine and 
"N" indicates any nucleotide. The nucleotide positions 
refer to coordinates represented in Figure 4. 
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TABLE 2 
RECOGNITION 


CUT SITE 


Aatll 




lie r. 


: none 


Ahall 






9 SO 




1/6 £ 


z none 


Avrll 




Po s 


247 


( CCTACR ^ 


veZ 


: none 


BspHI 




Po 6 


: 204 


(TCATGA) 


lie £ 


none 


Bsshll 




Pos 


138 

• X. -J w 


(GCGCGC) 


n fl f 


none 


EcoRI 




* U o 


ICQ 




ue x 


: none 


EcoRV 




Pos 




( GATATC^ 


vex 


i none 


Hgal 




Pos 


214 


( GACGCNNNNN 


lie £ 


none 


Hindi 


(NNNNNNNNNNGCGTC) 


Pos 


284 


(GTYRAC^ 


lie z 


J none 


Hpal 




Pa k 

A %J a 


1 ft ^ 

X 0 J 


(GTTAAC \ 


De t 


none 


Kpnl 






9 o n 


CGGTACC^ 


ue i 


none 


Nrul 




Pos 

X Is o 


H V © 


V A w w V w A / 


lie z 


none 


PaeR7 




P rt e 


1 74 

A, / *f 


\ v A w unu y 


ri a f 
lie i 


none 


Pvul 




Pos 


190 


\ w a A w w y 


lie z 


none 


Seal 




x O 5> 


17ft 


( ACTACT^ 


W A < 


none 


Spel^ 




Pos : 


209 


\ftwl AO A ) 


Def : 


none 


SphI 




Pos : 


131 


( GCATGC > 


Def : 


none 


Sspl 




Pos : 


338 


(AATATT) 


Def : 


none 


StuI 




Pos : 


371 


(AGGCCT) 


Def : 


none 


Tthllll 




Pos : 


149 


(GACNNNGTC) 


Def : 


none 


Xbal 




Pos : 


212 


(TCTAGA) 


Def : 


none 


Xhol 




Pos : 


338 


(CTCGAG) 


Def : 


none 






Pos : 


190 



309 



306 



334 



220 



193 
339 

266 
167 



303 



284 
359 



339 



WO 91/10737 



PCT/US91/00209 



-38- 

Most of these cleavage sites (about 60%) are compatible 

with the amino acids specified by C u l. Therefore, it is 

n 

possible to mutate C H 1 to create a unique site for such 
an enzyme without altering the amino acid sequence 
05 incoded by C fi l. One sequence which illustrates this is 
shown below: 

1) . .,ala met gly cys leu ala arg asp... 

2) ...GCC ATG GGC TGC CTA GCC CGG GAC . . . 

3) ...GCC ATG GGC TGC 4 CTA GCG CGC GAC... 

10 BssHIl 

Line 1 represents part of the actual amino acid 
sequence specified by the mouse Cp C^l gene region, and 
line 2 is the actual nucleotide sequence. By changing 
the sequence to the indicated nucleotides underlined on 

15 line 3, a cleavage site for the rare non- cutter BssHIl is 
created. The new sequence (containing the BssHIl site) 
GCG CGC still encodes the identical amino acid sequence. 
Therefore the sequence of the primer C is chosen to be 
the complement of line 3, and RE2 is the BssHIl site. 

20 Such a primer will function in the PCR and vector 

construction as desired. Other examples are possible, 
and the same process can be used in designing vectors and 
primers for cloning light chain variable regions. 

The choice for primer C puts a constraint on pFHC. 

25 In the example shown, the C R 1 region contained on pFHC 
must begin at its 5' end with the mutant sequence GCG 
CGC. Such mutant fragments can be readily made by the 
process of PGR cassette assembly described below. 
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05 



10 



15 



20 



The process of PCR cassette assembly is a method of 
constructing plasmid molecules (in this case the plasmid 
pFHC) from fragments of DNA of known nucleotide sequence. 
One first compiles a list of restriction endonucleases 
that do not cleave any of the fragments. Each fragment 
is then individually PCR amplified using synthesized 
oligonucleotide primers complementary to the terminal 
sequences of the fragment. These primers are synthesized 
to contain on their 5' ends restriction endonuclease 
cleavage sites from the compiled list. Thus, each PCR 
product can be so designed that each fragment can be 
assembled one by one into a larger plasmid structure by 
cleavage and ligation snd transformation into E^_colt . 
Using this method, it is also possible to make minor 
modifications to modify the terminal sequence of the 
fragment being amplified. This is done by altering the 
PCR primer slightly so that a mismatch occurs. In this 
way it is possible to amplify the Cfi gene starting 
precisely from the desired point in C H 1 (as determined by 
oligo C above) and creating the RE2 endonuclease cleavage 
site. 



25 
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CLAIM S 

1. An In .vitro process for synthesizing DNA encoding a 
family of antigen- combining proteins, comprising the 
steps of: 

05 *> obtaining DNA containing genes encoding 

antigen-combining proteins; 

b) combining the DNA containing genes encoding 
antigen- combining proteins with sequence 
specific primers which are oligonucleotides 
homologous to conserved regions of the genes; 
and 

c) performing sequence specific gene 
amplification. 



10 



15 



DNA encoding a family of antigen-combining proteins 
produced by the process of Claim 1. 

3. The process of Claim 1 wherein sequence specific 
gene amplification is performed by the polymerase 
chain reaction. 

4. The process of Claim 3 wherein the sequence specific 
20 primers are bidirectional. 

5. The process of Claim 3 wherein the sequence specific 
primers are nested unidirectional primers. 

6. The process of Claim 1 wherein the antigen- combining 
proteins are immunoglobulins. 
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9. 
10. 
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11. 
12. 

15 

13. 

20 14. 
15. 

25 
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The process of Claim 6 wherein the immunoglobulins 
are selected from the group consisting of heavy 
chains and light chains. 

The process of Claim 7 wherein the heavy chains are 
p chains. 

The process of Claim 1 wherein the DNA containing 
genes encoding antigen-combining proteins is cDNA of 
RNA from antibody-producing cells. 

The process of Claim 1 wherein the DNA containing 
genes encoding antigen-combining proteins is genomic 
DNA from antibody-producing cells. 

The process of Claim 8 wherein the antigen-combining 
proteins are of mammalian origin. 

The process of Claim 1 wherein the primers are 
oligonucleotides homologous to conserved regions of 
the constant regions of immunoglobulin genes. 

J*. 
m ' 

The process of Claim 1 wherein the primers are 
oligonucleotides homologous to the conserved regions 
of the variable regions of immunoglobulin genes. 

The process of Claim 1 wherein the primers contain 
at least one restriction endonuclease cloning site. 

The process of Claim 1 wherein the primers are 
selected from the group consisting of 
oligonucleotide B of Figure 2 and oligonucleotide C 
of Figure 2. 
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16. A method of creating a diverse starter library of 

DNAs encoding families of antigen-combining proteins 
comprising cloning the product of Claim 1 into an 
appropriate vector. 

05 17. a diverse starter library of DNAs encoding families 
of antigen* combining proteins produced by the method 
of Claim 14. 

18. The method of Claim 16 wherein the vector is a 
prokaryotic vector or a eukaryotic vector. 

10 19. The method of Claim 16 wherein the vector is a viral 
vector or a retroviral vector. 

20. The method of Claim 16 wherein the vector is a 
plasmid. 

21. The method of Claim 20 wherein the plasmid is 

15 selected from the group consisting of pFHC and pLHC. 

22. The method of Claim 16 wherein the vector is 
selected from the group consisting of expression 
vectors and cloning vectors. 

23. The method of Claim 22 wherein the expression vector 
20 is appropriate for expression of the variable region 

of an antigen-combining protein as a chimeric 
molecule in register with a framework protein. 

24. The method of Claim 23 wherein the framework protein 
is an immunoglobulin. 
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25. The nethod of Claim 24 wherein the immunoglobulin is 
all or a portion of the constant region of the m 
heavy chain. 

26. The aethod of Claim 16 further comprising creating a 
05 collection of viral particles from viral vector- 
based libraries of DNA encoding antigen-combining 
proteins by the process of introducing viral vectors 
into host cells in which they replicate and form 
viral particles. 

10 27. A method of producing a high diversity library of 

DNA encoding families of antigen-combining proteins 
comprising mutagenizing the product of Claim 16. 

28. A high diversity library of DNA encoding families of 
antigen-combining proteins produced by the method of 

15 Claim 27. 

29. The method of Claim 27 wherein mutagenizing is 
carried out by random chemical mutagenesis. 

30. The "method of Claim 27 wherein mutagenizing is 
carried out by performing the polymerase chain 

20 reaction under limiting nucleotide conditions. 

31. The method of Claim 27 wherein mutagenizing is 
carried out in such a manner that mutagenesis is 
limited to DNA encoding variable regions of the 
antigen-combining protein. 

25 32. A process of producing a diverse population of host 
cells which comprises introducing into host cells 
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DNA of tbe starter library or high diversity 
libraries of antigen-combining proteins. 

33. Host cells produced by the method of Claim 32. 

34. The process of Claim 32 vherein the host cells are 
05 prokaryotic. 

35. The process of Claim 32 vherein the host cells are 
eukaryotic. 

36. The process of Claim 35 Vherein the host cells are 
selected from the group consisting of immortalized 

10 cultured mammalian cells. 

37. The process of Claim 36 vherein the immortalized 
cultured mammalian cells are selected from the group 
consisting of myelomas and plasmacytomas. 

38. The process of Claim 32 vherein the libraries 

15 encoding families of antigen-combining proteins are 

introduced into host cells by a method selected from 
the" group consisting of: electroporation, calcium 
phosphate coprecipitation , protoplast fusion, viral 
infection, and'cell fusion. 

20 39. The process of Claim 32 vherein the libraries of 

DNAs encoding families of antigen-combining proteins 
Is contained in an expression vector. 



25 



40. 



The process of Claim 32 vherein the DNAs encoding 
families of antigen-combining proteins encode 
antigen-combining proteins selected from the group 
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consisting of immunoglobulin heavy chain variable 
regions or immunoglobulin light chain variable 
regions . 



41. The process of Claim 40 vherein DNAs encoding 
05 immunoglobulin heavy chain variable regions are 

introduced simultaneously vith or sequentially to 
DNAs encoding immunoglobulin light chain variable 
regions. 

42. The method of Claim 32 further comprising 

10 identifying cells which produce antigen-combining 

molecules of selected specificity. 

43. The method of Claim 42 wherein identifying of cells 
which produce antigen-combining molecules of 
selected specificity is carried out by assaying 

15 cellular supernatants for antigen- combining 

activity. 



44. The method of Claim 42 wherein identifying of cells 
which produce antigen-combining molecules of 
selected specificity is carried out by a 

20 nitrocellulose filter overlay technique. 

45. The method of Claim 44 wherein cells producing 
antigen-combining molecules of selected specificity 
are enriched for cells producing antigen-combining 
molecules on their surface by affinity matrix 

25 chromatography . 



46. Cells produced by the method of Claim 42. 
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47. Antigen* combining molecules produced by cells of 
Claim 42. 

48. DNAs encoding immunoglobulin heavy chain variable 
regions or immunoglobulin light chain variable 

05 regions, present in cells of Claim 42. 

49. Viruses produced by the method of Claim 26. 

50. A method of isolating viruses of Claim 49 encoding 
antigen- combining molecules of selected specificity, 
comprising the steps of: 

10 a) infecting host cells with an appropriate virus 

containing DNA encoding antigen-combining molecules; 

b) coating the virus with antigen- combining 
molecules which the virus encodes; and 

c) subjecting the product of step (b) to 

15 affinity-matrix selection, to separate the virus 

according to the antigen- combining molecules they 
contain:'* 

51. Viruses produced by the method of Claim 50. 

52. Antigen- combining molecules encoded by viruses of 
20 Claim 51. 
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19 heavy chomond/or light choin 
•n E. coli vector 



Increase diversity of libraries v.o 
random mutagenesis (optionol) 



Transfect libraries into cultured 
cells, where they are expressed 



Identify cu 
expressing Ab of 


itured cells 
desired specificity 

1 


I 




Isolate gene(s) encoding Ab of 
desired specificity and express 
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Table IV Oligonuc leotides used 
SYNLIB1 : 



SYNLIB2 : 

SYNLIB4 : 

SYNLIB5 : 

SYNLIB6 : 

SYNLIB7 : 

SYNLIB8 : 

SYNLIB9 : 

SYNLIB10 

SYNLIB11 

SYNLIB12 

JHSAL : 

CDRFOR : 
CDRBACK 



5'GCC TCC ACC TCT CGA GAC GGT GAC CAG GGT ACC TTG 
SoCoJ ATA GTC AAA (A/CNN) 5 TCT TGC ACA GTA ATA 
CAC GGC CGT GTC-3' 

5'GCC TCC ACC TCT CGA GAC GGT GAC CAG GGT ACC WG 
GCC CCA (A/CNN) 5 TCT TGC ACA GTA ATA CAC GGC CGT 
GTC-3 ' 

5 '-GAC CAG GGT ACC TTG GCC CCA ((A/C)NN)4 TCT TGC 
ACA GTA ATA CAC GGC CGT GTC-3' 

5 '-GAC CAG GGT ACC TTG GCC CCA ( ( A/C )NN ) 5 TCT TGC 
ACA GTA ATA CAC GGC CGT GTC-3 ' 

5' -GAC CAG GGT ACC TTG GCC CCA ((A/C)NN)6 TCT TGC 
ACA GTA ATA CAC GGC CGT GTC-3' 

5--GAC CAG GGT ACC TTG GCC CCA ((A/C)NN)7 TCT TGC 
ACA GTA ATA CAC GGC CGT GTC-3' 

5' -GAC CAG GGT ACC TTG GCC CCA ((A/C)NN)8 TCT TGC 
ACA GTA ATA CAC GGC CGT GTC-3' 

5'-GAC CAG GGT ACC TTG GCC CCA ((A/C)NN)9 TCT TGC 
ACA GTA ATA CAC GGC CGT GTC-3' 

5 '-GAC CAG GGT ACC TTG GCC CCA ((A/C)NN)10 TCT TGC 
ACA GTA ATA CAC GGC CGT GTC-3' 

5«-GAC CAG GGT ACC TTG GCC CCA ((A/C)NN)11 TCT TGC 
ACA GTA ATA CAC GGC CGT GTC-3' 

5' -GAC CAG GGT ACC TTG GCC CCA ((A/C)NN)12 TCT TGC 
ACA GTA ATA CAC GGC CGT GTC-3' 

5'- GCC TGA ACC GCC TCC ACC AGT CGA CAC GGT GAC 
CAG GGT ACC TTG GCC CCA-3* 

5'- CAG GGT ACC TTG GCC CCA-3* 

5'- GTG TAT TAC TGT GCA AGA-3 ' 



Human VH Back Primers 



HuVHlaBACKSfi 
HuVH 2 aBACKS fx 
HuVH3aBACKSfi 
HuVH4aBACKSfi 
HuVH5aBACKSfi 
HuVH6aBACKSfi 



5 '-GTC CTC GCA ACT GCG GCC CAG CCG GCC ATG GCC CAG 
GTG CAG CTG GTG CAG TCT GG-3' 

5^-GTC CTC GCA ACT GCG GCC CAG CCG GCC ATG GCC CAG 
GTC AAC TTA AGG GAG TCT GG-3' 

5' -GTC CTC GCA ACT GCG GCC CAG CCG GCC ATG GCC GAG 
GTG CAG CTG GTG GAG TCT GG-3' 

1^-GTC CTC GCA ACT GCG GCC CAG CCG GCC ATG GCC CAG 
GTG CAG CTG CAG GAG TCG GG-3' 

5 '-GTC CTC GCA ACT GCG GCC CAG CCG GCC ATG GCC CAG 
GTG CAG CTG TTG CAG TCT GC-3 r 

5'-GTC CTC GCA ACT GCG GCC CAG CCG GCC ATG GCC CAG 
GTA CAG CTG CAG CAG TCA GG-3' 
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CLAIMS : 

1. A method of obtaining a member of a specific binding 
pair (sbp member), the sbp member being an antibody or 
antibody fragment and having an antigen binding site with 

5 binding specificity for an antigen which is a self 

antigen of a species of mammal, the method comprising: 

(a) providing a library of replicable genetic 
display packages (rgdps), each rgdp displaying at 
its surface an sbp member , and each rgdp containing 

10 nucleic acid with sequence derived from said species 

of mammal and encoding a polypeptide chain which is 
a component part of the sbp member displayed at the 
surface of that rgdp; 

(b) selecting, by binding with said self antigen, 
15 one or more sbp members with binding specificity for 

said self antigen. 

2. A method according to claim 1 wherein said providing 
a library of rgdps comprises: 

combining (i) a first polypeptide chain component 
20 part of an sbp member fused to a component of a rgdp 
which thereby displays said first polypeptide chain 
component part or population thereof at the surface of 
rgdps on expression in a recombinant host cell organism, 
or a population of such a first polypeptide chain 
25 component part fused to a said component of a rgdp, with 
(ii) a second polypeptide chain component part of an sbp 
member or a population of such a second polypeptide chain 
component part, to form a library of sbp members 
displayed at the surface of rgdps; 
30 at least one of said first or second polypeptide 

chain component part or populations thereof being encoded 
by nucleic acid which is capable of being packaged using 
said component of an rgdp. 

3 # A method according to claim 1 werein said providing 
35 a library of rgdps comprises: 

combining (i) nucleic acid which encodes a first 
polypeptide chain component of an sbp member fused to a 
component of a rgdp or a population of such a first 
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polypeptide chain component: part fused to a component of 
a rgdp, with (11) nucleic acid encoding a second 
polypeptide chain component part of an sbp member or a 
population thereof, to form a library of nucleic acxd, 
5 nucleic acid of said library being capable of bexng 
packaged using said component of an rgdp; 

expressing in a recombinant host organism saxd first 
polypeptide chain component part fused to a component of 
a rgdp or population thereof and said second polypeptide 

10 chain component part of an sbp member or a population 

thereof, to produce a library of rgdps each displaying at 
its surface an sbp member and containing nucleic acid 
encoding a first and a second polypeptide chain component 
part of the sbp member displayed at its surface. 

15 4. A method according to claim 1, 2 or 3 wherein each 
said sbp member displayed at the surface of an rgdp 
antibody fragment comprising a V H domain and a V L domain. 
5 A method according to claim 2 wherein both saxd 
first and second polypeptide chain component parts or 

20 populations thereof are expressed from nucleic acxd 
capable of being packaged using said component of an 

6 9dP " A method according to any preceding claim wherein 
each said sbp member displayed at the surface of an rgdp 
25 is an scFv antibody fragment. 

" 7. A method according to claim 2 or claim 3 wherexn 
said second polypeptide chain component part or 
population thereof is encoded by nucleic acid separate 
from nucleic acid encoding said first polypeptide chaxn 

30 component part or population. 

~ " 8 A method according to claim 1 or claim 7 wherexn 

each said sbp member displayed at the surface of an rgdp 
is an Fab antibody fragment;. 

9 A method according to any one of the preceding 
35 claims wherein the nucleic acid is derived from 
rearranged V genes of an unimmunised mammal. 
10. A method according to any one of claims 1 to 8 
wherein the nucleic acid is derived from a library 
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prepared by artificial or synthetic recombination of v- 
gene sequences. 

11. a method according to claim 10 wherein the library 
is derived from germ line V-gene sequences. 
5 12. A method according to any one of the preceding 
claims wherein said species of mammal is human. 

13. A method according to any one of the preceding 
claims wherein sbp members selected in (b) displayed at 
the surface of rgdps are selected or screened to provide 

10 an individual rgdp displaying an sbp member or a mixed 
population of said rgdps, with each rgdp containing 
nucleic acid encoding the sbp member or a polypeptide 
chain thereof which is displayed at its surface. 

14. A method according to any one of the preceding 

15 claims wherein nucleic acid which encodes a selected or 
screened sbp member and which is derived from an rgdp 
which displays at its surface a selected or screened sbp 
member is used to express an sbp member or a fragment or 
derivative thereof in a recombinant host organism. 

20 15. A method according to claim 14 wherein nucleic acid 
from one or more rgdps is taken and used to provide 
encoding nucleic acid in a further method to obtain an 
individual sbp member or a mixed population of sbp 
members, or encoding nucleic acid therefor. 

25 16. A method according to claim 14 or claim 15 wherein 
the expression end product is modified to produce a 
derivative thereof. 

17. A method according to any one of claims 14,15 and 16 
wherein the expression end product or derivative thereof 

30 is used to prepare a therapeutic or prophylactic 
medicament or a diagnostic product. 

18. Use, in a method according to any one of the 
preceding claims, of a kit comprising a library of 
nucleic acid sequences capable of being packaged in rgdps 

35 and which encodes a polypeptide chain component part of 
an antobody for display at the surface of rgdps. 

19. Use, in a method according to any one of claims 1 to 
17, of a kit comprising a library of rgdps each 
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containing nucleic acid encoding at least one polypeptide 
chain component part of an antibody. 
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